AVOIDANCE OF CACHE SYNONYMS The invention includes a method of operating a branch 
The present invention relates to cache memories and *5L hT^/TF^* a ,T he mem0ry with a CAM 
methods of writing data such as tonrtiTSToSj ' Enfof fSfijS ? ^ C ° Upkd t0 3 

memory of a computer system 5 £££ , £ P 01 * 00 - ,0 P^ent a single branch 

5 instruction being stored as a multiple entry in said buffer 

BACKGROUND OF THE INVENTION - T method comprises selectively controlling validity of 

tv 0r more ^ es cache, invalidatino one line of the 

To run a computer program a computer repetitively < = ache - data including an address tosaid one uL * 

cames out a sequence of functions, typically fetehin. an the same time as an associate ooerafio iJrS ^ , ■ 

mstmcUonheldatagivenadoress.decotogtheins.rTction 10 the same address, and c^SSSt^SS^^St 

accessing an operand for use by the instruction, executing hne * dependence on the resu? o tfTa^Se oZtZ 

Ae Ruction, storing the result of the execution an! ««*y validating said line only if no cachS isS m 

determining the next instruction address. Problems can one embodiment said CAM is used to^toe addr« s IS 

occur where an instruction contains a test whose result rela ^S to a plurality of branch istrucuSs a^d s2d oa£ 

deternuaestteaddressofthenextinstrucUontobeexecuted. » P^ory portion is used to store for each oranS inSructtn 

An instruction of this type is known as a conditional jump. ? ** cache, a targe, address for Suction 

The consequence of the presence of a conditional jump °gcther with a prediction of whether the branch will be 

instruction is that the instruction typically has to pass ^ When f he instruction is executed by a processor said 

through several pipeline stages in a processor before the test , W " te Ration effecting a modification of at least one of 

is resolved, and before the next instruction to be fetched can *, . ***** address 01 Prediction in the event of a cache hit 

be determined with certainty. This can delay the pipeline . g said w-rite operation. Preferably said data memorv 

pr0CCSS - P portion also holds a prediction strength "value m adStion to 

, A program sequence may also include non-conditional !!^ £f! .^ 0Q - . and Said wri,e operation is effective to 

jump instructions. Such instructions, if executed, result in ,< said Prediction strength value in the event of a cache 

the program sequence jumping to a new instruction. S wnte Ration. 

For*epurposesof m epresentdesCTipuonandclaims.the JJl^J! na °\, inCludes a cache memoi y comprising a 

term "branch instrucuon" will be used to include both f addressablc a*™* (CAM) portion having a plu- 

conditional and non-conditional jump instructions J cou P led to a respective line of a data memory 

■\ In this specification the term instruction includes primi,^ E^J^™^ 0 * having an associa ' e . W ut for 

uve operations which may be included in a VLIW S?2 tin- " ^ ratioD and ; 

using Very Long Instruction Words. An insr^ctioTwS o" 2K£SSS. l ° ^ ' Wte 0perati ° D - Said 

sequence may therefore comprise a VOW instruction £ be ^ connected to input selection circuitry 

To reduce delay in deterging the JtEi* ~^E^Ztt52Z**l 

branch instructions may be loaded into a cache memory. 3S effected in paraUel ^STJS^SS^ST. * 

L u 2uSo a n s* buff « to ^ » * - ^«-i£55irs 

Tt « ,„ J- ~ 77 circuitry operable in response to a result of an associate 

a rfL«c w J FeSent inventi0D to avoid one °P. eratlon to P^ent data related to said common address 

addressable entry, such as a branch instruction, being M bcin S held in more than one line of said cachT 

entered outdoor more lines of the same cache. « The invention includes a branch target buffer" include a 

SUMMARY OF THE INVENTION 2TJ^TZ * f fcreald m wh5c h said cache includes 
•n. • te modification circuitry operable in response to a write 
The invention prov,des a method of loading entries into a fP^ 0 * to modify data stored for a branch instrucrion 
cache memory comprising a content addressable (CAM) 45 located bv a cai *e hit during a write operation. Preferably 
portion having a plurality of lines coupled to a respectire «*» data memory portion includes storage dS?£ 
line u a data memory portion, which method is character- stoim S a prediction strength value in addition to said ore 
isea ny effecting in parallel an associate operation and a for any branch instruction held in said cache and 
Z«l °?!l atl0n ™* thC Same address **• «> d usin g *e said «°rage circuitry is responsive to said data modification 
S!ZL to'** 0 *?* tocoatrol validation of the 50 c,rcui try to modify said prediction strength value in response 
S 1 "? 1 aad prevent a si °8l e addre * having . ?° a ^ ^"'on relating to a branch instruction ii said 
M J . same Preferably the method buffer - ^e invention includes a computer system compris- 
ududes selectively controlling validity of one or more lines a mem ory for holding a plurality of instructions a 
in ffie cache, invalidating one line of the cache, writing data Processor for executing a plurality of instructions seouen- 
inciuding an address to said one line at the same time as an SJ ^ ^d a branch target buffer as aforesaid for predictino 
associate operation is carried out using the same address. a ^S" ad dress for a branch instruction fetched from said 
and controlling re-validation of said one line in dependence me mory for execution by said processor 
on the result of the associate operation, thereby validating MTO 

said line only if no cache hit is found. BRIEF DESCRIPTION OF THE DRAWINGS 

. Ittonc embodimentinresponse to a cache hit in one CAM <so nG - 1 shows a simplified example of a series of instruc- 

7i7 s *wwi&&Mmmi>irpd^*lf^'>tto*i- .. •-•.•i :-. •• ■ . . ..... .. „ 

Et3 o c-P"^ 00 - Preferably an associate HG. 2 shows a partial schematic diagram of circuity' for : 
input or said CAM is connected to selection circuitry implementing the method of this invention- 

associate input. e . Ha 4 shows illustrative circuit diagram of read * 

circuitry of a partition of the branch target buffer of FIG. 3; 
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instructions of FIG 1- S 2£ 3 T a 'T- one conditional jump instruction, instruction 9. 

ftp, a V , , ?" e founf » instruction word 104 contains one unconditional 

JSkJJSZ tl CXamplC ° f UpdaUnS 1116 branch ^ instnJction - Action 12 and one conditional jump 

predicted values for an instruction; s instruction, instruction 14. The fifth instruction word con? 

FIG. 7 shows a second example of updating branch ns no j um P or conditional jump instructions, 

prediction values for an instruction; It will be appreciated that the program flow due t6 

FIG. 8 shows write circuitry for one line of a partition of executia € the instruction word 101 will always be the 

the branch target buffer of FIG. 3 connected to part of a f!^' 1 ameIy ""J?** the first instruction 4 of second 

computer system, and to instruction word 102. Executing the second instruction word 

• • conditional jump is effected), executing the second instruc- 

DESCRHTION OF THE PREFERRED *° &e fourth instruction word 104 (if the conditional 

EMBODIMENT ~ is -! um P at instruction 5 is effected) or executing the first 

FTP i • r instruction Oof first instruction, word 101 (if the conditional 

0-19 in \ ™ f SenCS ° f tWCnty ° ne " byte instructions jump at instruction 7 is effected). Similarly, the outcome of 

0-19 in a computer program. As seen in the figure, each executing the third instruction word 103 is either to execute 

SEZZT'"??'!* "? kMkh DUmbCr ( °- 19 > thC *" inStrUcU0D 12 0f fourth instruSo w^r? 04Se 
and by an address indicative of a storage location at which M conditional jump at instruction 9 is not effected) or executes 

b^™-vT ? T?* " d WWch " ^'""ted by the the second induction 13 of fourth insSSon woSSfS 
S,T, f° ° f the i n . sttuc « 0 nnumber.Theinstructions conditional jump instruction 9 is effected) FmaUy the 

inVtm^r; » ? S : 0f mese ^"P 5 h termed « executing instruction 16. which occurs if the first instruction 

mstruction word Each instruction word has a single word „ 12 of instruction word 104 has been executed o » *Z 
£££ ^^Sf** W ° rd 101 beifl S "000". that of " conditional jump instruction. 14 if not effected or H 

JJE C 1°? bCiD . g ?° r> aDd S ° ° D - ** ins,ruc - ahcrMtive Outcome " a '"urn * instrucSon 2 of fost 
tion has a byte address indicating its position within the instruction word 101 • 

S^-^S^^" ° f W °? ha$ byte , F ° r ° ptimum ** d of °P erati ^- *e computer, bavin* 
aoaress ,00 . the second • 01 -and so on. The majority of the - fetched a given instruction word for executioT <hn ,M «3 

*i ^ I ?• ° nt0 next "<l ue ntial instruction; instruction word containing the next inmcfoTxMAh 

these instructions are marked 'X'. Instruction 12 (marked required to be executed. However ft wmTseen fiSf * * 

reiche IE? ? Un ~f U ° nal jUmp ' beC3USe * execution f0 "S 0ifl S ** *• Menu^Tthe n«f i2«r SonTo 5 

S^S!tT 9 U Jt^ t0 « CXe r d ^ Va * * * e «~ insuu^on worn JonLn 

^ 14 J™** 1 CJ > 316 each tenned an y braach instruction. Take for example fourth instructbn 
condiuonal jumps. The next instruction to be executed after word 104: ^ instruction 

* urnca oy lnstrucuon 13. outcome of the conditional jump in instruction 14 

stoT e ST"; 4 ' instructions will be fetched from Continuing to consider the fourth instruction word 104 

SSSTStS 3"*°° 0 f « ched fast : and 1116,1 fetching a woog word (for exam P le - faa 22 

i^f^ , ° D - H °^ eVer - in CCrtain 55 101 fetehed 105163(1 of fifth instrucdon word 105) 

SSJSSi^f . Prosed to associate plural instruc- would result in the processing pipeline containing a numbJ 

toS£t£> a „^ q H CnCe K? 11 , ed " 1 - Stn,Cti ? n WOrdS ' 50 11131 ° f Wr °°S instructions which would of course oof£ 
olS fa£S! B a ^ Sab,e , Iocaa ° n mak6$ aVailable 1116 executed 7116 P ractical consequence of this would be that a 

SSn2ST • « W °f d fOT eX J C< i Uti0n - ^ * C preSCDt DUJnber 0f C y cles wou,d be ' ost - dur h.g which no usefii 
dwcrffiuon^ term in^rycuon word is not intended to be « execution took place; only on a subsequent correct cvcle 

oftfn^ U Used *> r *™>'c to «fer to a plurality " -WOuM^cor^ttfW^ • 
fl^Tf • 01 ° ne Y 1 ^ ins^ction. which may be after being called up into the pipeline. 

T^^Z S -TS e addK ? ^ St0K - Asimplmeddevic^iUustratuig someof the featuresof the 

fir^Wn!^ ^insfcuon words shown in FIG. 1. the present invention will now be described with respect to FIG 
first instruction word 101 contains no branch instructions ie 65 2. • ^ 

jUn * S ^ 00 uncond itional jumps. Tfie Referring to FIG. 2. the device" comprises memory cir- 
second mstrucuon word 102 contams two conditional jump cuitry 20 0 which consists of a memory *Ly Saving a 
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plurality of addressable locations and an address decoder 
210 for addressing the memory array 205. The address 
decoder 210 has an input 201 over which it receives instruc- 
tion addresses within a single instruction word. The address 
decoder produces address outputs corresponding to the input 5 
instruction address and all later instruction addresses up to 
the end of the relevant instruction word. The number n of 
storage locations in the memory array 205 is at least equal 
to the number of instructions in the program sequence 
currently being run. The memory locations of the array 205 10 
store at the address of respective instructions, information 
showing whether the instruction is a branch instruction or 
not. i.e. a logical 0 where the instruction is not a branch 
instruction, and a logical 1 if the instruction is a branch 
instruction. Each memory location has a respective output 15 
line 220J-220,,. The output lines 220 A -220 rt are coupled to 
a register arrangement 250. having storage locations corre- 
sponding to each of the lines 220 L -22Q n . for storage of 
branch prediction values. Each register location has a 
respective input formed by one of the lines 220 A -220 n and 2 o 
a respective output line 280^-280,,. The register locations 
store logical 1 where an associated. branch instruction is 
predicted as taken, and a logical 0 where an associated 
branch, instruction is predicted as not taken. When an 
instruction address is input over input lines 201. the address 2 5 
decoder 210 addresses the corresponding locations, of the. 
array 205 and. where an instruction at a corresponding 
address is a branch instruction, there will be a logical 1 
output on the corresponding one of the output lines 
220 r 220 :J , which lh: tyjn accesses...the register .250 and 30 
produces 1 " off- an output line 280; either a corresponding 
logical 1 if the branch is predicted as taken, or a correspond- 
ing logical 0 if the branch is predicted as not taken. Where 
one or more of the addressed locations is not a branch 
instruction, logical zeros will be output over the correspond- 35 
ing output lines 220^220^ 

The logic stage stored in register location corresponding 
to non-branch instructions is not significant because no 
logical 1 can occur on a line 220 A -220 n unless the associated 
instruction is a branch instruction. As a result the output for 40 
each location which corresponds to a non-branch instruction 
will always be logical 0. The output lines 280^280,, of the 
register 250 form word lines to a store 300 which stores 
target addresses of branch instructions. The store 300 has a 
first address output 301 at which the store 300 delivers the 45 
target address of a branch instruction, and a second output 
302 which provides a logical 1 when any branch instruction 
is predicted as taken. The store 300 has one row 300 1 -300 rt 
for each word line 280 A -280„, and each row contains 
memory cells connected to the output lines 301 so that 
application of a logical 1 to one of the word lines 280,-280,, 
produces an address on output lines 301. Store circuitry 300 
also contains gating circuitry having an output to the second 
output line 302 and producing a logical 1 at output line 302 
when a branch is predicted as taken. The store circuitry 300 
further contains decision circuitry which provides only the 
target address of the first jump instruction from the word 
which is predicted as taken. 

Before operating the device of FIG. 2. it is initialised by 
sequentially addressing the memory array 205 and.storing a 
logical 1 at the address locations of memory array 205' which 
correspond to branch instructions. The register circuitry 250 
is loaded, during the initialising stage, with prediction 
information indicating whether or not the associated branch 
instructions are predicted taken. The store circuitry 300 is 
loaded with address information- corresponding to the target 
addresses of the branch instructions stored in memory array 
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205. The circuitry necessary for effecting the initialisation of 
the circuitry of FIG. 2 is not shown. 

Once the initialisation has been effected, the program 
sequence can be run. To do this, the address of the~first 
instruction, instruction 0 of first instruction word 101 is fed 
to. input 201 to memory circuitry 200. This causes address- 
ing of the memory array 205 .at all locations corresponding 
to instructions in the first word. Because the first instruction * 
word 101 contains no jump instructions, no logical 1 will be 
output Hence the output lines 220 A -220„ will all carry 
logical zero, and these logical zeros are applied to the 
register 250. The result of applying all logical zeros to the 
inputs of the register store 250 is to provide on output lines 
280,-280,, outputs comprising all logical zeros. The store 
circuitry 300 receives the logical zeros and provides an 
output of logical zero at the first output 301 and an output of 
logical zero at the second output 302. 

In the present example, the first instruction word 101 
contains no branches and thus no target address is output at 
output 302. Accordingly execution proceeds with the fetch- 
ing of the second word 102. 

Accordingly the next instruction word to be fetched is the 
second instruction word 102. For the second instruction 
word 102. the memory circuitry stores a logical 1 at the 
address corresponding to instruction 5 at byte position 2 the 
byte positions in each word being designated 1 to 4) of that 
instruction word and a logical 1 at the address corresponding 
to instruction 7. at byte position 4 of that instruction word. 
Logical one is output from the memory circuitry over the 
output "lme 220£-2207correTpo¥dmg 
and over the output line 220 A -220 n corresponding to the 
fourth position in the second instruction word 102. These 
logical one inputs are provided to the register 250 which in 
turn provides logical one outputs over those register output 
lines 280 x -280 n which correspond to branch instructions 
which are predicted as to be taken. Thus, the one of the 
register output lines 280 A -280 n corresponding to the second 
position of the second instruction word 102 will carry a 
logical one if it is predicted that the conditional jump 
instruction 5 will be taken and the line corresponding to the 
fourth instruction position of second instruction word 102 
will carry a logical one if it is predicted that the conditional 
jump instruction 7 is effected. 

These logical one inputs are provided to the store circuitry 
300 so as to read the target .addresses of the predicted-taken 
branches and the decision circuitry outputs the target address 
of the first occurring predicted-taken branch instruction. 
This is because if, for example, conditional jump instruction 
50 5 were predicted as taken, instructions 6 and 7 cannot be 
executed if the prediction is correct Thus the earliest 
c>ccurring predicted-taken branch in an instruction word 
determines the next instruction to be fetched. 
The target address is used by the processor of the 
55 computer, to cause a new instruction word to be fetched 
containing the instruction at which execution is predicted to 
proceed- 
In this simplified embodiment as mentioned previously, 
where execution of the instructions in an instruction word 
60 starts other than .at the first instruction of that word, the 
address of the mitiai insthi ction is input over inputs 28i -and*.' 
the address decoder 210 only applies addresses correspond- 
ing to the re mainin g instructions of the word to the memory 
array 205. Thus, for example described with respect to FIG. 
65 l. if execution of the fourth instruction word 104 were to, 
. commence at instruction number 13, (for example in 
response to the conditional jump at instruction 9), the 
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memory array output ikes 22D.-220 mt ?, ? e b " ffe f / word addre « input bus 900 

. _ . , . . . . "* is provided to write word addresses into the CAM arrays a 

The above ^description is intended to illustrate the con- target address input 908 is provided to write into the data 

cepts of this invention. One embodiment of circuitry for RAMs target addresses for the branch instructions which are 

carrying out the method of the present invention, this w newly written into the buffer. As will be later described, anv 

circm^ including a branch target buffer, will now be write operation into the buffer is carried out simultaneity 

described with reference to FIGS. 3-». with a read operation using the same word Jfi 322 

Referring firstly to FIG. 3. a branch target buffer 400 900 as a read ^P^t 500 together with the appropriate byte 

consists of four generally similar sections 401-404, each add ress 501 to avoid writing into the buffer any instruction 

referred to hereinafter as a partition. W which is already held in the buffer. Other inputs and outputs 

The number of partitions of the present embodiment m shown in FIG. 3 and their function will be described with 

corresponds to the number of instructions per instruction referenc e to the more detailed drawings of FIGS. 4, 8 and 9. 

word; and in the example described above with reference to As wiil seen with reference to FIG. 3. each of the 

FIG. 1. this number is 4. It will of course be understood that Partitions 401^104 is substantially similar and therefore a 

other numbers could be used according to the content of the 20 deta iled description of one exemplary partition 401 only will 

instruction word; specifically in a more complex system 8 be 8 iven - 

KEff ^ i ^ Cd * lt WlU • alS0 * a PP feciated ^ °* e Refe ™g t0 HO- 4. partition 401 consists of n content 

skilled in he art that fewer partitions could be provided. For addressable memory cells 510,-510. coupled to Ve SS 

SStt nZ U 5 ^ * ** f0Ur - bytC eXample 0f 25 • ti0n Word a ^ss input bus 500. having a ta p^tf 
HG. l to only provide two partitions, one corresponding to 25 CAM output lines 511,-511 . As will be late tocrihS 
the first two bytes of each instruction word, and the oLr herein. the 'present embodimen rftli^it^ 
corresponding to the second two bytes of each instruction operated dynamically, in the sense that once S of Snes 

except one contain data the partition is regarded as "full" 
~ In, the pxesent example.. first partition 401. stores data - The one unfilled line is retained for writing a new entry and 
relevant -to. branch, instructions which are located at the first ' as part of the write operalionT'deails'of one branch Instruc ~ 
byte of a word stored in the buffer, second partition 402 ^on stored in the branch target buffer are discarded" so as to 
stores data relevant to the branch instructions at second byte be rcad y f °' iaput-of a next newly-found branch instruction 
locations and so on. First partition 401 is referred to herein A bra och instruction is said to be "newly-found" if it does 
as the lowest partition, and fourth partition 404 as the „ not e *ist in the branch target buffer at the time of testing for 
highest partition. presence of the branch in the buffer. It will be noted that a 

Each partition of the buffer 400 includes a CAM array branch instruction which was discarded from the branch 
holding a plurality of word addresses and associated data tar 2 et buffer m 0 ™ cycle of operation may become a 
RAMs holding target addresses for branch instructions "newly-found" instruction during a later cycle of operation, 
located at the word addresses stored in the CAMs. The buffer 40 Thus * whereas increasing the number n of content address- 
has first and second input buses 500.501 giving the address able mem °ry cells increases the complexity and size of the 
of an instruction being fetched. The first input bus 500 is an device - such an increase tends to reduce the number of 
instruction word address bus receiving the most significant occasions on which a branch instruction needs to be written 
bits of the instruction address and the second input bus 501 m ' 

is ;an instruction byte address bus for the least significant bits 45 Decreasing the number n of memory cells provides a 
of the address of the instruction being fetched. In the smaller and simpler device, but with the penalty that the 
example of FIG. 1, the first three bits of the instruction chances of failing to find a branch instruction are increased, 
address form the instruction word address and the lowest Where a jump instruction is not found this tends to lead to 
two bits form the instruction byte address. An input on bus a processing delay. 

SSrS,! T"**? 0pCr f ° D 10 f Ct f miDe tf 50 10 fte P fescntI y described e«*odim«t. content address- 
ee buffer holds data corresponding to the word address of able memory cells 510,-510,, are capable of storing Ae 

SLSSSZSl SSL'S!?- The !V 501 b used 10 ^"^^^'im^MSSiiZ 

controlthe output of theCAMairays during a read operation 500. As will be described later herein, certain instruction 
w? ,1 °f >eratic>n which P revents a" 1 !*" w ord addresses are stored in the content addressable 

SnT^HW^l corresponding to a byte location 5J memory cells. The instruction word address of the instruc- 
SVSL fv * " e i"** 0 * mpUt 0n UBe 501 tioa being fetched is input over the instruction 

m P ut 00 50 . 1 « S^ter than the partition word address bus 500 and when an instruction word address 
2^52" f ■ V ° per . a 1 tt0n .I n fte ou ffer 400 has a "t for corresponding to one of the stored addresses is input a 
the relevant insertion idenufied by the inputs 500 and 501. logical 1 occurs on the corresponding one of the CAM 
*.M? u ?! r P ro ^ d « AW address output on bus .507. an . M output lines 511.-511 

output S(a:^i^%'mi % hiinyi£g-'M(»i^^-it^S&^'-- '" , ^T^^ v l%ii-WKtV '«..»"•• '• "' . * . 

wmVhpartitionscontambranchms^ctionsfromthe^nl in< 3 M ^ I ™ T? * 
instruction n/orrf u/h.vh " jTi nomine current instruction byte address bus 501 and is connected to the 

StSSS Motion S ^ » A" CAM output lines 511,-Sl^ so as to selectively disabled 

*LS£i 2K32. P **» ofapartition. The selection circuitrV 512 has a 

it -wilt fv nn/iwrtl^ . v « prejecssing circuit 560 receiving one input from the instruc- 

a ,i, , 0e uoders tood that when a processor outputs an tiori byte address bus 501 and having an output 561 con- '" 

address to memory to fetch a new instruction, that address nected to the control inputs of a plurality of pull down 
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511, -511, and earth. During «^» . conareTEut ftS SM fi.St « " ™ P T*^ P rocessin S circuitry 

- K :r \ to t skss£s2S£ 5 se? jarr* ^ ™ s laRer is dcscribed ^ 

tSSti £ ^^ffiS^JSSS o A ut RCfeninS ; ° h nG - 3 - «* « — — above, the 

these partitions wh«e the storea branc , i£i5£ta * £T!!5f ^ StniCtures t0 11131 of * e 

byte location before the byte position KfaEfSXeXl Itt 4 ° L 7" ' each P 3 ** 03 which « 

The selection circuitry 512 has output l£es 513 I|l3 10 S " tulaE I t0 * ose , of P 3 ""™ 401 have similar reference 

which form inputs to%redictio?p?oc«ia7 dJEjai Zlnt^tT' T^-" ^ 4 °* being in 

The prediction processing circuitry 514 includes an * 00 799 ^f^nf 3 being 10 thei3n S e 

n-location store 563 for storing prediction information for IZ'lll th ° Se ° f parUUon 404 * *e range 

indicating whether an associated branch instruction is pre- 8U <™- . 

dieted taken or not taken. Each of the store locations 13 . M P reviouslv Ascribed, between each respective parti- 

563,-563„ corresponds to arespective CAM cell, and has an ^ 311(1 &e ad J' acenl partition there is a first transmission 

output connected to one input of a respective two input AND 8 ate 531 - 631 - 731 - 831 and a second set of transmission 

gate 564,-564,,. the other input of which is provided by a 8 ates 53 °- 63 °- 830. Both the first transmission o a te and 

respective output line 513,-513. of the selection circuitry second set of transmission gates are controlled by the 

512. As shown in FIG. 8. an OR gate 565 is connected 20 out P u ' "nes 517. 617. 717. 817 of the respective NOR eate 

*"«P^-"torelocations563andtherespective 516. 616. 716. 816 of the associated partition. In any one 

Aivu gate 5M,-»64„. but as this forms a direct connection partition, the operation is such that when the respective NOR 

S 1^ n v herein - At P redlction store 563 «°res a inputs, the responding output line 517 617 717 817 

output of NOR gate 516 is connected to OR gate 590 which tw, ~u,;„„ t« k i. • 

is omitted for clarity in FIG. 4 as it has no effect durin* a .J? 1 ™ *"» t0 bfa f Ch ™«™*«» 316 * a parti- 

read operation as the signal on line 920 is 0 durinoaTead ?°? ^corresponds to *e byte position of the branch 

During a write, the gate 590 in each partition receives a 4? ^^^°^^^cuonword For example.data relating 

signal 1 on line 920. receives a to a branch instruction in the first byte position of any word 

Data bam ^nc.-cc ~t ... 1S stored in partition 401. data relating to a branch 

Sift /sir tf ge ^ - s^js^^i^j-*: 

mation is made available on tatgeTa^e^u^usIS K^C^^SS"^^ ,0 inStn,C , t i° nS L 

for isolating partitions storing data 35 relating to the prediction as to whether the brancfa Ts S 

tedM tem 5 Inn SLT" W ° rd aS ,-« m ^ late iS St0red 13 toe P 1 ^ 00 P r ^S circuit audita 

506 hi a ^ " d 10 cn^ C ampMer Ckcuitr y relati °g «o ^ '"get address of the branch instruction in the 

bait 903 usiT ! 3ddre$S . f 0U, P ut 507 3 °? 3 t3r S et ad ^ RAM of the partkion. In the presently described Sample 

Sfrfl lomlfn^f, 8 ? PCraU0D - ^ u bUS , 5 ° 3 f0lmS the ^ which presence of branch inSon 

'iS^SSS^S^Sf^SS^ 60 is the fuU instruction word address of the word containing 

Id iSn^SmSS . *^ < ^ R ^ I f. t «Stlected ^ eria^y ftfc branch instruction As will belati described herem.:S:- 

8 1D 1 ^ P3,hS betWee ° Possibletostoreonlyapartofmeinstructionwordaddress! 

Th«- hr an ^ „ u » ■ . The 4313 chosen for iDd 'ca t i n g the presence of branch 

strath n^T W" hl * r -J2f 0 eonsists of Prediction- instructions is stored in the content addressable memory 

strength.processing circuitry .550. . . • .. «, ceUs 0 f the respective partition. 

a , fire?' , U °f" Strength Pf oces / iD S ^ary 550 receives. The location of. data identifying branch instruaionT'li' 

as first inputs, the output lines 513,-513 n of the selection shown in FIG. 5 for the FIG. 1 example. Referring to FIG. 
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5. it will be seen that cells 510 of the first partition 401 not. and plural AND gates 564 The AND gates receive one 

contain an entry of ' Oil" because the instruction word 104 input from the associated selector circuit^ and one from a 

(having address pil ) has an unconditional jump instruc- respective entry of the register. Thus, considering instruction 

tion in the first byte position. Similarly the second and third No 7 of FIG. 1. the prediction processing circuit 814 of the 

words . 10 2-10.3 (word addresses "001" and s fourth partition will either produce a logical 1 from a 

010 ) have conditional jump instructions at the second byte respective AND gate on one of the output tines 815 fif the 

position- of the instruction words, namely the positions branch instruction 7 is predicted as taken) or will produce all 

corresponding to instructions 5 and 9. Thus, the cells 610 of logical 0's.on the output tines 815 (if the branch KSon 

second partition 402 store word addresses "001" and "010" 7 is predicted as not taken) instruction 

of words 102 and 103. Entries are similarly made in the third c„ r ,•,..„,• , 

and in the fourth partiuon (coiresponding to instruction 7). nn ,s» „.„,„,«„ eie „, u - «. ''T . a 10 3 lcai l 

itiii r^ m -n 0f r,Trrr « •„ ,u . ♦ % .u • _ " 0D ^ ou $> ut UDe 815 wl uch corresponds to the position of 

Still referring , to FIG. 5 in the context of the instructions the branch instruction information stored in the content 

shown in FIG. 1 an input is made to the instruction word addressable memory 810 causes: 

address bus 500 of the word address of the instruction , -ru, niltnilf rtf v,oo <,»,» «i« ^ ~~ . ■ • , a , 

currently being fetched; this address is applied by the bus to " , 3? ?Ut ° f 83,6 ?* * *° ,0 k * rtI 0 and - 

the CAM cells 510. 610. 710. 810 of {he whole device. 2 " ™f con f row * *e data RAM 818 to be 

Where a match occurs between the word addresses input to it out P utUD 8 onto the address out- 

the bus 500 and an address stored in a CAM cell, a logical P f A buS ^ addres . s " 0000 °" Le " '«« ^S" 

one output (referred to as a "CAM hit") is provided bv the M n J*?* 1?' *T • "f^ 0 "-. . . ' 

relevant output tine of the CAM. If for example the word 1- in * c remainiB g- first-third, 

address "000". corresponding to word IMis » Pf^ 0 ^ 401-403. there will be ho logical 1's at the output 

matches occur in any partition and no CAM hits will occur. funTZTZ ,? JH^T Ef^* ^ 514 

However, when the word address "001" of the second t^'^l ° ^ S1 ?' * U ' ™ WiU haVe 3 

instruction word 102 is input over bus 500. a match occurs „ ^ 1 ^"c,^™ C0lres P 0ndin S ttans - 

in the CAM cells 610 of the second partition (corresponding * ZT % $ 53 °f° ? C '' °^f 31 ^« Active and 

to instruction 5-see FIG. 1) and in the CAM ceU?810 of * f 6 corresponding lines of the predicted-taken bus 505 

the fourth partition (for instruction 7-see FIG. 1). The J^S^J^Z**]^ T 15 **** from * e 

result of these matches is to produce CAM hits on the CAM [T? ^ U °^ f 04 ",? 13 £™ 5 ??™ tpat fr0m partition 

Outpullmes61i;811associated.withtheCAMcellswhere „ " 5 * V * v • 

the hit occurred. •"" - a second example, assume that fourth instruction 

rin^tri.rtJnn it ttt^ in To av *u T j stored 111 the branch buSei. the branch target buffer 

Vk. . „. . ' ... because that instrucUon will not be executed in sequence 

The selection circuitry 512. 612. 712, 812 includes 45 with instruction 12. The branch target buffer takes this into 

respective processing circuitry 560 (see FIG. 4) which acts account because the NOR gate 516 of the first partition 

during reading to disable any partitions which correspond to provides a logical 0 output over its output line 517 to render 

bytes lower than the byte address input on instruction byte non-conductive the transmission gates 530. 531. thus iso- 

address inpu bus S01. Thus for an input byte address of lating the second-fourth partitions 402. 403. 404. The second 

00 no partitions will be disabled, and all selection circuits so set of transmission gates 530 prevents the output of .target 

will pass any hits from input 511.611 etc to output 513.613 data from the data RAMs of the other partitions and. as will 

etc. If the instruction byte address is "01" then selection later be described herein, the first transmission gate 531 

circuitry 512 of the first partition 401 will not pass any hit prevents the prediction strength of predicted unexecuted 

from an input line 511^511,, to an output line S^-Stf,,. branch instructions from being updated. Although this pri- 

whereas in other partitions, any hit wiU be passed from input 5 5 oritising feature has been described for the unconditional 

to output If the instruction byte address were "10" then any branch instruction 12. it should be noted that the branch 

hit would, only be passed by selection circuits 712 and 812 target buffer is not aware that 12 is different to any other 

in the third and fourth partitions, and so on. In the present predicted-taken branch. The buffer will therefore treat any 

case, the instruction byte address is "10". thus enabling only other predicted-taken branch in the same way. i.e. act to 

selection circuitry 712 and 812 to pass any hit. However, the p exclude any output foe Jatet. branches, in the same word, 
only hit which occurs is in the fourth 'partition ^ and' 'this is ' " - ^ 

therefore allowed to proceed as an input to the prediction tion word 102 is to commence from the first instruction of 

processing circuitry 814 of the fourth partition. that word (induction 4 of FIG. 1. having address "00100"). 

It will be recalled that the prediction processing circuiuy . Assume for the puipose of this example that instruction 5 - 

514.614.714 and 814 each consists of respective prediction 6J (00101) is not predicted as taken and that- instruction 7 

store 563 which contains a logical 1 or logical 0 indicating (00111) is predicted taken, the instruction word address is 

whether a respective branch is predicted as being taken or input over instruction word address input bus 500. which 
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will produce : logical 1 CAM "hits" in the second and fourth SK^S^. The pull down transistors 573.-573 are con- 

pamuons 402. 404 The byte address 'Wis input over nected to a corresponding entry 571,-571 of the prei Son 

dSSZ^f^&fJ!" rt 5 v l ° f s tt engu,store571 P Theo^u7oftheAND%ates5 P C-?70 

ZTi J™* a'a ° f *l P T u° RS ; A - ^ e ? U0 ? sd form inputs to an n-input OR gate 574 havin* output 504 

Jit? address causes each of the selection circuito- 5 which includes an output prediction sttengfcline for each 

to pass hits from input to output to provide conespondins partition >"™ 0 m unc ror eacn 

logical l's to Ae prediction processing circuitry. In the , ^ opCTation . when a logical , Qa Qne f ft . 

presently descnbed example, such logical l's occur in the | ine? -fit r;„,w;™ *.» u u - . ? 

second and fourth partitions 402 and 404. In the second £ • ^ in f. tr r i ° D h * 5 

partition 402. the relevant register of the prediction process- l0 £ B j£ £j U °? 0 *?ES" V* L <* to °«f * 

ing circuitry stores a "not-taken" prediction, and as a result , . v 1 !?* ^ pr ° duCe 

all of the outputs of the prediction process^ circuit 614 !S °, ut P ut fK 0 ^ contending stage of the 

remain logical 0. indicating no branch jSfeSSS. Bv GS^SfftSS^SZ § "V* '^T*' 

x tv. .j , . . ~ applied to one input of a respective one of the AND oate* 

a) Provides a logical 0 at the output of the associated NOR 572,-572,,. 

gate 816; and Xhc Q J put of the AND ga(e 572 -573 will be at logical 

b) causes the data RAM 818 of the fourth partition to 20 0 unless the update enable line 502 is at logical 1 which 
output a target address of "00000". As none of the causes an automatic update of prediction strength to an 

lower partitions has a non-conductive transmission interim new value. In this event, one of the AND *ates 

gate, this target address is passed through to the target 572,-572, will have a logical 1 output, which causes* the 

address output bus 503. associated one of the pull down transistors 573,-573 toturn 

As a result it can be seen that the branch target buffer 25 on. pulling the corresponding stage 571.-571 of the ore- 

described above identifies only the first predicted-taken diction store 571 to be pulled to logical 0. thus" causing the 

branch instruction of a sequence, which is not excluded for prediction value stored in that stage to change to logical 0 

execution by being prior to the first instruction of the indicating a strong prediction or remain at logical 0 if it was 

sequence to be executed, and, more specifically, the target already in that state 

.address of. that instruction.. _ .As previously, noted, there is '-a. respective transmission 

As previously, discussed with respect to FIG. 4. each gate 531,631 etc'connected ih'tfae update enable line 50*> 

partition of the branch target buffer includes circuitry 550 between each partition and the next higher partition It will 

known herein as prediction strength processing circuitry for be recalled that this transmission gate is conductive unless 

storing information based on the history of the branch the relevant partition has identified that a branch instruction 

mstructions identified in the corresponding partition. As 35 is predicted taken. In that event, the transmission gate is 

previously noted FIG. 4 represents an exemplary partition rendered nonconductive during the operating cycle by the 

401 and the prediction strength processing circuitry 550 has respective line 517, 617, 717. 817 going to logical 0 At the 

counterparts 650, 750, 850 in the other partitions. start of each cycle, the transmission gates ar°e conductive 

^ Prediction strength processing circuitry 550 receives and a logical zero is applied via the line 502 to all partitions 

«£ UtS c^i 0m CaCh ° f ^ selection circuitry output lines 40 as a precharge leveL The logical zero is then disconnected 

513^513,,. A logical 1 will occur on one of those output but the line remains at that leveL Once transmission gates 

lines 513,-513,, when a match occurs between the word 531. 631 have gone non-conductive in partitions where a 

address input on bus 500 and data indicating the presence of branch is predicted taken, a logical one is applied to the line 

ciActn mstruction storcd ta ohe of ce *k 502. The consequence of this is that a logical Ion the update 

510,-510,,, provided the selector circuitry 512 has not 45 enable line is input to each partition in ascending order up 

disabled the partition because the instruction in that partition to and including any partition in which a branch instruction 

is pnor to a first executed instruction of the relevant is predicted as taken. Later partitions, regardless of whether 

sequence. Each of the inputs provides a first input to a or not they contain predicted-taken branch instructions do 

respective AND gate 570 A -570 n . The other input to each . not receive the logical one level needed to update the 

AND gate is derived from a respective entry 571,-571,, of 50 prediction and thus are not automatically updated, 

a store 571 in the prediction strength processing circuitry If during a fetch operation an instruction word is reco*- 

which stores information indicative of whether there is nised as having plural branch instructions, it is desirable that 

associated with the corresponding instruction a so-called the above-discussed automatic updating take place for all 

"weak" prediction or a so-called "strong" prediction. In the those branch instructions which are not excluded from 

present embodiment, a weak prediction is represented by a 55 execution by virtue only of the initial execution point within 

logical 1 stored in the corresponding stage, and a strong the word, up to and including the first predicted-taken 

prediction is a logical 0 stored in the corresponding stage. branch. As an example, if ail of the partitions 401-404 stored 

A strong prediction indicates that a high degree of con- a jump instruction for a particular instruction word and if 

fidence that the presently stored prediction is correct, and execution were to start from the branch instruction in 

r ft^fQre w uiu^ely lo.be. changed. whereas a weak prediction 60 partition 402 (predicted not-taken) and the instruction in 

indicates' a lower degree of confidence- in the correctness of "partition'403 wer^ pfeditfed 1 

me present prediction, and a greater likelihood of change. performed on the predictions stored in partition 401 

The lines 513 A -513 a also form one input to respective (because this instruction could not be executed) and no 

AND gates 572,-572*. The other inputs to the AND gates update should be performed upon the instruction represented 

. lT " 5 . 72 " *t -Provided by the update enable line 502. 65 .by partition 404 (because the instruction represented by 

which is common to all those gates.The output of each AND partition 403 is predicted as taken, thus preempfrgany ' 

gate 572 A -572„ controls a respective pull down transistor judgements on the instruction represented by partition 404). 
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h,.Sf i„ a! f T ® " P f ° r m ^ braDCh tarSCt inst ™<*°°- i* the start of the cycle C, Similarly C 

buffer in the following way: represents the second occasion at which the branchSuc 

Since in this example the byte address input over byte tion is fetched, and the resolved prediction faTeSTc 

a m V^Jf bUS 50 - CaUS " * e SdeCti0D C ' UCUitly 312 t0 re P resents * e P"di«ed value for cycle C and sTon In 

f^ttfV** S?^ 40L r m ^ n ° l0giCal rS ° n 5 each * e P^ioa is updated 5o p^i" aTinte'r£ 

any of the hnes 513. Thus no logical ones will be applied to update by assuming that the prediction * fcfsJit of Te 

Aepredicuon strength circuitry 550 and the output of the - cycle was correct J execution^ the nitron nows tfiS 

NOR gate 516 remains at logical 1. Gates 530531 remain the prediction was in fact wrong, the interim update 1 

conducuve Although there will be a CAM hit in the second discarded, and a new prediction rn^de. To£ ™ P Son 

partition 402. and the selection circuitry 612 allows the 10 is based upon the previous prediction. « £ o fZ 

corresponding logical 1 to be applied to the prediction cycle, and the knowledge gamed durm* reSuUoT 

processing circuitry .614. the 'not-taken' prediction has the In the presently-described embodSt Sch instruc 

consequence of providing all logical 0's at the lines 615. tion is put in the BTB only when it is executed t o caSe a 

Thus NOR gate 616 has a logical 1 output and gates 630.631 jump from the normal sequence. Thus at S^f the en^ 

the third partition 403 causes the NOR gate 716 of that T" Oaken once) ""uucuon is 

partition to provide a logical 0 output, thus rendering non- Referring to FIG. 6 the progress of a jump instruction 

conducuve the transmission gate 731. Hence the update line which is initially correctly-predicted will bcTsS 

SL JSZ fl!° P 0 ^ s401 ^ 03 - ^ ^connected At the start of cycle C,. L prediction iT^yLn" 

tZZJZSZ? IV- N ° • P ^ tmS m r^ 00 401 ° CCUrS 20 aDd il is assumed ^ * e bl ™ h **S« ^« mat the jump 

because alogical lis required on one ofthehnes 513 for this will in fact be taken. Thus the prediction state is updated^ 

-J; 1 ^ . ... ,. , . At update enable line 502 and the automatic update circuitrv 

Theprediction strength information, as mentioned above. 572. 573 etc to "sttongly-taken". In this casTthe jump h 

EE? 7? ^ 0D ******* ° f ideDtified ^^^^^cmtbc-ln^SSS^ 

Smce the prediction of the outcome of a branch-i.e. cycle which includes fetching and executing the instruction 

Ztof^^7^T iS ^ hr ^^ At the Start ° f <* cle theprediction S^S^SS 

time of fetching, rather than executing, the branch and the jump is resolved as "taken". The interim update 

instrucuon. it is necessary to update the prediction for use prediction remains strongly taken and. as the jurnTis 
:Jhe next time that particular jump instruction is fetched.. 30 -resolved as being, taken, the resolved prediction is likewise 

depending on whether or not the prediction currently being "strongly taken* ' ' 

made is found to be correct or not during actual execution. However, if in" cycle C, the prediction is incorrect in that 

Thestra^gyadoptedm&epresently-describedembodiment the jump is resolved as being not taken then SffouSS 

is to update automatically to an interim new prediction on applies: 8 

tte assumption that the present prediction is. in fact, correct. 35 At a start of cycle C 3 the resolved prediction was 

K the intenm prediction is found to have been correct when "strongly taken" and the interim update is thus "strongly 

fte branch is resolved, then the interim prediction is retained taken". However, as the jump is resolved as "not-taken" to 

as the new prediction for the next execution, referred to prediction requires updating to "weakly taken" as shown 

herein as the resolved prediction; only if the present predic- Thus it will be seen that the strength not the prediction is 

tion is found to have been incorrect is there a need for 40 changed. 

correction and a corrected 'resolved prediction' is stored For the next cycle. C 4 . the prediction value is still "taken" 

The branch target buffer defaults to a state in which no although "weakly-taken". Thus the interim update will be 

branch is predicted taken for any cycle in which corrections from "weakly-taken" to "strongly-taken". on the assumption 

are being undertaken. This enables the branch target buffer that the predicted behaviour is correct. If however once 

to be implemented as a single-port device, the single port 45 again the jump is resolved as being not-taken, the resolved 

being alternatively used for reading out of predictions and prediction must be corrected to "weakly-not-taken" in other 

£SST (4 7 • , « • WOrds ~™*» * e Prcdictiofof 

Referring to FIGS. 6 and 7. the process of updating the cycle C, to the resolved prediction of cycle C 4 requires the 

predichon and prediction strength will be described. Both prediction to be changed, rather than the siengto to be 

i-l(j6.6and7showthehistoryofabranchinstruction which 50 changed i.e. from "weakly-taken" to "weakfy-not-taken" 

centered into the branch target buffer during an entry cycle This change is made by an associative look-up in the present ■ 

(b). For this explanation, it is assumed that each time the embodiment. 

branch instruction is fetched, the previous execution of the Finally at the start of cycle C, the prediction value is 

instrucuon has been resolved In practice, it may be possible "weakly not taken", and accordingly the "interim update" is 

for the instruction to be fetched again before a previous 55 to "strongly not taken". If the jump is resolved as "not taken" 

execution has been completed, as will be later described the resolved prediction is "strongly not taken" whereas if the 

herein. On entry, the prediction and prediction-strength for jump is resolved as taken, then the resolved prediction 

any newly-entered jump instruction is "weakly-taken" (wT). would be "weakly taken". 

The prediction and strength are stored as a logical 1 Turning to FIG. 7. the progress of a jump instruction is 
SP^M^.P'f^M^ S63. : 663. I63...fi0. shown, in which the first resolution of the jump instruction 

863 and as logical 1 (indicating "weak") in the prediction after entry into the branch target buffer is intt**ecX^u£in r 

strength store 591 etc. to form, the prediction for that cycle C t the interim update, assuming that the prediction 

instrucuon rwhen it is executed next time. value of "weakly taken" is correct, is to "strongly-taken". 

to both.FIGS. 6 and 7. cycle Q represents the next cycle However, the jump is resolved as "not-taken". As a result. 

an-which-thepresenUy-consideredinstruction is fetched, and 65 the resolved prediction is ''weakly-npt-taken". in other' 

the resolved prediction for the previous cycle represents the words requiring theprediction at the start of the cycle! which" 

predicuon state at the time of next fetching of the was "weakly taken" to be corrected to "weakly not taken". 
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At the start of cycle C, the 'Veakly not taken" prediction via a second transmission gate 910, The output of the 

gives an interim update of "strongly not taken", and the second OR gate 906. provides the control Xu zo me 

resolution of the jump as "not taken" confirms this value as transmission gates 918, and 910,. The second St to tl 

ttZ 3 H ° WeVer : iD C > althou - 2h ** "~nd OR gate 906. is* provided by the output of an t^D 

interim update is to rematn at "strongly not taken", the 5 gate 911. which receives a first input from the co CT «wntoo 

resolution is ^aken thus the resolved prediction at the end output of the selection circuitry 512 andT econd S S 

of cycle C, is "weaJdy not taken", i.e.' once aaain changing an input line 912 seconainput Horn 

Sf^ 6 !; 8 ^ 0 !! 116 P redicti o D -. rather *an prediction As previously mentioned, the write enable line 901 is 

itself. This should be contrasted with cycle C4. in which the operative to allow an address input over w£te buT^O to be 

predicted value of weakly not taken", at the start of the 10 stored in the CAM cell 510,. At all timeTone Imfin -acn 

cycle is updated to "strongly not taken", but as a result of a partition comprising a CAM cell aliieo >Z IrZ £ £ 

resolution to "taken" is finally conected to "weakly taken". RAM and S^S^i^^SS^^ 

tion ^Jl 3 gC ° f PrCdiCti0n ValUe - DOt ° f Predic " ^M^«tov^^Iti, tolJS? 

J. j- * ^ . a new branch J um P instruction is written. To render the 

The above discussion of the branch target buffer generally 15 partition line shown in FIG. 8 invalid, an input is a D o£ d to 

relates to the circuitry in operation of the buffer in the control input 904. so as to set the latoh 1SL iffiSS 

reading mode. Detatis of the circuitry for writing informa- conductive the pull down transistor 902, Tfie eie« of tiris 

Uonto an exemplary line of the branch target buffer will now is to pull the CAM output lineSll. down to e£in Toe CAM 

be described with reference to FIG. 8. FIG. 8 shows an output line is then held at earth until a newScted v£ue 

exemplary content addressable memory cell 510.. being one 20 is written into this line and the line is ma^keS Sd as £Z 

of the memory cells 510 in the first partition 401. This described. 

S££???** ^T 17 - 51 °* f ° r * e pUipose of To write a new P«dicted-taken entrv to the line shown in 

illustration is shown as having four storage locations for FIG. 8. it is necessary to write information to me cS«» nt 

stonng a four bit instruction word address. The content addressable memor/cell 510.. to the *L ad*es1 row 

addressable memory coll 510. has an output line 511, upon 25 518. to the prediction store 563. and the pSofslenS 

which there appears a logical 1 "CAM hit" when an address store S71, In the present embodiment theSction fofa 

S * C r^^n W ° rd addrMS topUt tUS 500 fiDdS new ^^red branch is "taken" and the preoiction stren^ 

^ n .l t i-^r"- v is selected ,0 * ""^i *«» overall prediction levefS 

The output kne 511. passes to the previously-described "wtakly-taken". To accomplish this, the circuit 112 respondl 

scfcction circuifi? 512 to an output Ike 513, In the writing 30 to an entry command from the process^ U0 to oauSS 

mode input 912 changes state from the reading mode so thai first input kne'907 to supply a logical 1 and the second input 

deselection circuitry 512 provides a connection between its line 909 to supply a logical 1. When the write enaWeTe 

mput and output where the partition is to be written to. but 901. goestoloScal l.the outoutXefeco^OR ^ 9W 

for ak other partitions the circuit 512 provides no connec- goes to logical 1. thus enabling the transmission £s 918* 

tion between input and output and those outputs are held to 35 and 910. and causing the logical 1 on line 907 to be written 

fZ^t c ?««t addressable memory cells are not only to prediction store register stage 563. (representing "taken") 

connected to instruction word address input bus 500. but and the logical 1 on second input line 909 to be written to 

alsotomstnictionwordwritebus900.Tnisbusisconnected the prediction strength stage 571. (repVesentu^TealA 

t S ?L age 1 locat, ? ns °! content addressable The address data presented on the write bus 900 is written 

memory. Each of &e storage locations of the content adores- 40 to the CAM cell 510, as has previously been mentioned. A 

sable memory 510. is also connected to a write enable line logical 1 applied to line 920 ensures that OR oate 590 

901. which receives a logical 1 when the address currently outputs a logical 1. thus rendering conductive the fransmis- 

appeanng on the write bus 900 is to be written to the content sion gates 530. The presence of the logical 1 on the write 

jfJSS? f Hn<l 7i? CU W SCpaiate WrfK eMbIe enable m »- P rovided at * e **«to the first OR oate 

is provided for each line of each partition. A further control 45 905,however provides a logical 1 on the word line 515 to 

line 920 connected to all partition lines is provided to allow the RAM line 518. and this allows target address data to be 

3 ^? £55 address ,!° * ^ rittM to the RAM 518. written into the corresponding line 518. of the RAM 518 

ine LAM output line 511. is connected to a pull down from a taiget address input signal 90S 

transistor 902, operable to pull the CAM output line to To ensure that there is always an empty line available for 

I' 7? C FiL*""!. transistor W2 » is controlled by a 50 each partition, every time a new branch is entered into the 

control latch 903, which is selectively set or reset by an branch target buffer a random number generator 113 ran- 

input 904. to mark the line valid or invalid The write enable domly selects a line in the partition which Is to contain the 

one * connected to one input of a two-input OR new branch and inputs a bit to the corresponding latch 903 

S *i!m * $C 0thCr inpUt h P rovided b y * e o^" 1 of «« so foat the line is marked invalid. The line to°which the 

AND gate 564. of the prediction processing circuitry 514 55 branch instruction is written is selected usin» a stored value 

and the output of the OR gate 905. forms the word line 515. of the previously-selected random number. This technique 

for row b of the RAM 518. The write enable line 901, is requires a first row decoder 115 to output signal 904 to mark 

fiirther connected to one input of a two-input OR gate 906, a line as invalid and a second row decoder 114 to output 

Register entry 563, of the prediction store 563 is connected signal 901 to add the new branch. The random number 

• ,0 i a. Pfed^uon , change .enable, .knf m allowing..a new. 60 .generator .113 is eonfiguredto ensure that the same number 

prediction to be written in from the circuit 102" via a is not generated twice successively.-' :v - : ->' "■ ■■■'•■.:- 

n-ansmission gate 918, The output of the prediction store FIG. 9 shows schematically a computer system including 

563, is coupled via an OR gate 565, to the AND gate 564, the branch target buffer 400 already described. A processor 

The second input of the OR gate 565, is provided by the 110 is arranged to fetch instructions from a memory 111 and 

controlline 920. Prediction strength storage Iocation571, is 65 execute aplurality of instructions in a pipelined process The 

connected to an input line 909 allowing a new prediction fetch address is output on line 120 and this full instruction 

strength to be written in from state transition circuitry 112. address is supplied to the memory 111 and at the same time 
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through a multiplexor 116 to the inputs 500 and 501 of the ,• ^ 

buffer 400. The fetched instructions are supplied on Ifae m of associatIve "suited in no hit. then the cause 

to the processor for execution and the results of a read 1£ g ' n ! une * new and the line to 

operation in the buffer 400 are fed on lines 504 505 and 507 information is being written is marked as valid. In this 

to the processor 110 as previously described At an appro- < °, T S ? 13is H ^ so an output 923 of an OR 

pnate stage in the processing pipeline, the processor UOwfll f C , 922 15 !° w : ^ e 923 foims a ° input to latch 903. so as 

resolve each branch instruction to determine whether it is in 1° r * C , la< : ch for the Dew ^ Reeled by the high signal 

fact taken. If the processor had received from buffer 400 a ° hne m * ,hereb 3 r rnarting that line as valid Si°nal 903 

correct prediction for the branch instruction then no write causes latch 903 * t0 respond to signal on lme 904 < .°to mark 

operation is required. If however no correct prediction had a new ^ 33 invaJid - ' ' 

£n n ! !!^f d i dUe '° d * er abse0Ce of the braDch in struc- 10 ? 3 W ° n§ P rediction 1"5 teen made, then the automatic 

nr™ b f °- 3 ™ s P redic ft"» by the buffer) then the Properties of the described brimch target bSKj 

Processor outputs signals to the circuit 112 to cause a write. result « a wrong value of interim update offiction bS 

The outpu on hnes 122 comprises the result of resolution. stored Reference should be made te^GS TanS anS 

is responsive to 'the *^^5£kH^^ 20 SS^ - 1 Potion was "strong 

toel22tochangepredicUonorstreng P thoSedi^ 20 !j'° ed °' *en if the branch were 

in accordance with the resolution of the bfanch £S r!^ a °$°T * e Stren 8* bit ™« be 

by the processor 110 as has previously been ETKf Acco 1 rdm Sl>'. referring to FIG. 8 the third input 

reference to FIGS. 6 and 7. The updated prediction and %L?J* P ™ Ut * !*! of a »«" prediction, and 

strength of prediction values are input on lines 907 and 909 P red f ca . on strength bit mio the corresponding line of the 

to the appropriate entry in the buffer 400. together with the P rcdlcu0 V.'°re s « and me prediction strength store 571 

other data to be written, including the target address on line For of the value of prediction, or prediction 

90S. as already described with reference to FIG. 8. In this s1iea S^- the write enable line 901. is kept at a logical 0 thus 

case the output bus 900 outputs the full instruction address Preventing writing of a new address into the CAM cell or 

■i?*?^ f0f *S %Tite oP«ration into the buffer and f new «l« addrc ss into the RAM 518. and instead the third 

9?2 WUTT "• ^ ™ pUt t0 multi P kx °r 116. Line 30 "f" 1 1206 912 istake » «> ^cal l.The presence of a logical 

*7™,itwl frresponding to a write operation and 1 (corresponding to the match between the input instruction 

tiie mulupkxor 116 selects the address from line 900 to input address on bus 500 and the information stored fa theTAM 

EStiS** 501 -° ° Ut " aSS0Ciate °^on P on ' cell 510.) provides a losical iTtfe sS ta? 5*f 

instruction addresf as these are fed ftSE r m new prediction value to be input over line 907. and the new 

510-S1C In FIG. 9 the nSp£at U6 S ^"eS* ^ "£ T" ° Ver «* 9 ° 9 " 
from the address line 120. a s^ond input 125 from Ifae 900 <o nrSrSn^^ 1 " ? ? aibed With res P ect t0 HG - 3- no 
and a control input 126 from line 912. fa wc! write ff!"«f ed 113 ^ where a write is being 

operation the branch target bufferreceives an fapntrromjhe S ? C n ° h tM * et ""^^ *e branch target 
state transition circuitry 112 attempting to correct the con- I produces no target address nor prediction output 
tent of the buffer without distinguishing 'whether the error is • a PP reciated ** ** above example provides 

due to a misprediction or to the absence of the branch « I P reveDtin S a double entry in the buffer for the 

instruction in the buffer. Accordingly, the embodiment per- Same * ulstrucUon: II *™ be understood that due to the 
forms the same remedial action in both situations as follows- P r( f essor Pipeline, there wfll be delay between a new branch 
1. Input the instruction address to the branch target buffer S2S^SS S. S P f° CCSSOr 3nd f ° Und 

for an associative match. At the same time begin to SSJ^i?* * C ^ rCSOlUtiOD of &e branch 

write to the invalid line in the branch Urgrt Sffer » \^ PtQCeSS< * Which WOuld **» result » a 

information as if the error had occurred SabseS S-^ 8 k*J s ™*™M* oircuitry 112 to cause a 
of the branch instruction in the buffer. If thepreS Sft™ ^5 ^° U ^ * " BBUktt >- il is P ossible 
on line 907 is taken, then line 920 is set toCcT to StS* "*"Wion could be fetched a second 

allow the new target address to be written, toffi ca e £S2rlT K SSOT ^tf! ** CXCCUtion of 1,16 branch 
signal 920 being 1 provides a logical 1 iwut to /SS « A beea / e$oh -«i and caused a write entry into 

gateS64,IfaCAlw,occ«r;tomiS^ * S^^^" 0 avoid the possibSty of 
signal513 6 issettologicall and consequentiy OR gate ITlnL^ J ^ ^trucuon into the buffer at a 
905. provides a logical 1 on wordline 515 of £ ^ a ^. n when *%^struction was not located in the 

RAM. In addition, line 920 causes circuit 506 to put Se 5f t WaS ^ d,ed by * C prOCessor ' 11 " for 

.. . .,ne.w4arset address Aom input 908 onto the RAM hit en ^ • T ™ associam ' e match on the branch address 
• lutesSOJsothat'^ 

that row of the RAM 0n nne 90 °- ^ use ° r,b « miilUplex6r 116 ensured itfiatW ' 

2. If the associative match aeates a hit this indicates th.r V** f addrCSS /4 0n u bus 900 b used simultaneously for both 
the branch had already been looted fa bv t b^h FT" ?± ° perati ° n is Caffied M ^ 

targe, buffer. As a result til ?Lw) 52 65 S SS rS^hen^ ^ ^.^^ 

entry is terminated by leaving the JcTZ Z ,? / ^ * CD cntty 11116 1S maintained as 

invalid M inv ? Ld and a hns 00 whi eh the hit occurred has a modifi- 

cation of the prediction strength and target which have been 



